Maitri Demonstration: Managing Large Scale Scientific Data (Demo)
نویسندگان
چکیده
Even traditional commercial database systems do not scale to the size of today’s large scientific data sets, whose growth is outpacing Moore’s Law. Instead, scientists are wedded to special-purpose data formats and their associated I/O libraries, even though these libraries provide only basic functionality. Thus there is a need for a scalable data management system that can support these formats and, when needed, provide more sophisticated functionality for indexing, buffering, caching, concurrency control, metadata management, and querying. This demonstration showcases Maitri, a framework that can be used to address these needs. The Maitri framework consists of a set of standard, very narrow interfaces for format-agnostic, loosely-coupled libraries offering aspects of the functionality listed above. Format independence is provided by Maitri’s block manager module, which encapsulates all code that is specific to a particular scientific data format, and calls the appropriate scientific I/O library to read and write data in that format. The demonstration shows the Rocketeer visualization toolkit running with subsets of the Maitri module implementations to visualize rocket simulation output data. The Rocketeer runs show the efficiency that can be gained by layering format-agnostic query, buffering, and/or indexing facilities atop scientific I/O libraries. The demonstration also shows that despite their narrow interfaces, the implementations of Maitri modules work together very effectively.
منابع مشابه
Maitri: A Format-Independent Framework for Managing Large Scale Scientific Data
Even traditional commercial database systems do not scale to the size of today’s large scientific data sets, whose growth is outpacing Moore’s Law. Instead, scientists are wedded to special-purpose data formats and their associated I/O libraries, even though these libraries provide only basic functionality. Thus there is a need for a scalable data management system that can support these format...
متن کاملSyntactic Structures of Sentences from Large Corpora
The demonstration will consist in displaying the syntactic structure of sentences from novels, scientific texts, newspapers, ... The syntactic structures are computed by our syntactic parser and the output shows in a human-friendly graphic interface (1) word features (as computed by POS tagger) (2) nonrecursive phrases (as computed by shallow parser) and (3) their relations (the functional stru...
متن کاملAZDBLab: A Laboratory Information System for Large-Scale Empirical DBMS Studies
In the database field, while very strong mathematical and engineering work has been done, the scientific approach has been much less prominent. The deep understanding of query optimizers obtained through the scientific approach can lead to better engineered designs. Unlike other domains, there have been few DBMS-dedicated laboratories, focusing on such scientific investigation. In this demonstr...
متن کاملMaitri: Format Independent Data Management for Scientific Data
Today’s scientific applications are very data intensive, and their data management requirements can no longer be met by special-purpose libraries for particular scientific data formats or by traditional database management systems. This paper proposes Maitri, a data-format-independent, loosely-coupled, application-tailorable set of libraries that provides a holistic data management framework fo...
متن کاملJob Provenance - Insight into Very Large Provenance Datasets
Following the job-centric monitoring concept, Job Provenance (JP) service organizes provenance records on the per-job basis. It is designed to manage very large number of records, as was required in the EGEE project where it was developed originally. The quantitative aspect is also a focus of the presented demonstration. We show JP capability to retrieve data items of interest from a large data...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007